Distant Speech Recognition in a Smart Home: Comparison of Several Multisource ASRs in Realistic Conditions
نویسندگان
چکیده
While the smart home domain has become a major field of application of ICT to improve support and wellness of people in loss of autonomy, speech technology in smart home has, comparatively to other ICTs, received limited attention. This paper presents the SWEET-HOME project whose aim is to make it possible for frail persons to control their domestic environment through voice interfaces. Several state-of-the-art and novel ASR techniques were evaluated on realistic data acquired in a multiroom smart home. This distant speech French corpus was recorded with 21 speakers playing scenarios including activities of daily living in a smart home equipped with several microphones. Techniques acting at the decoding stage and using a priori knowledge such as DDA give better results (WER=8.8%, Domotic F-measure=96.8%) than the baseline (WER=18.3%, Domotic F-measure=89.2%) and other approaches.
منابع مشابه
Reconnaissance d'ordres domotiques en conditions bruitées pour l'assistance à domicile (Recognition of Voice Commands by Multisource ASR and Noise Cancellation in a Smart Home Environment) [in French]
In this paper, we present a multisource ASR system to detect home automation orders in various everyday listening conditions in a realistic home. The system is based on a state of the art noise cancellation stage that feeds recently introduced ASR techniques. The evaluation was conducted on a realistic noisy dataset acquired in a smart home where a microphone was placed near the noise source an...
متن کاملThe Sweet-Home speech and multimodal corpus for home automation interaction
Ambient Assisted Living aims at enhancing the quality of life of older and disabled people at home thanks to Smart Homes and Home Automation. However, many studies do not include tests in real settings, because data collection in this domain is very expensive and challenging and because of the few available data sets. The SWEET-HOME multimodal corpus is a dataset recorded in realistic condition...
متن کاملIntegrated Fuzzy Control of Temperature, Light and Emergency Conditions for Smart Home Application
Smart home is composed of several controllers with different plants in control. If each controller works independently, without considering the mutual effect of the others in the control process, the whole system could definitely not converge to an optimum desired status and may not ever reach the demanded condition. The function of different controller system may has conflict In some condition...
متن کاملSpecial issue on speech separation and recognition in multisource environments
One of the chief difficulties of building distant-microphone speech recognition systems for use in `everyday' applications is that the noise background is typically `multisource'. A speech recognition system designed to operate in a family home, for example, must contend with competing noise from televisions and radios, children playing, vacuum cleaners, and outdoors noises from open windows. D...
متن کاملOn Distant Speech Recognition for Home Automation
In the framework of Ambient Assisted Living, home automation may be a solution for helping elderly people living alone at home. This study is part of the Sweet-Home project which aims at developing a new home automation system based on voice command to improve support and well-being of people in loss of autonomy. The goal of the study is vocal order recognition with a focus on two aspects: dist...
متن کامل